Stability of Bivariate GWAS Biomarker Detection
نویسندگان
چکیده
Given the difficulty and effort required to confirm candidate causal SNPs detected in genome-wide association studies (GWAS), there is no practical way to definitively filter false positives. Recent advances in algorithmics and statistics have enabled repeated exhaustive search for bivariate features in a practical amount of time using standard computational resources, allowing us to use cross-validation to evaluate the stability. We performed 10 trials of 2-fold cross-validation of exhaustive bivariate analysis on seven Wellcome-Trust Case-Control Consortium GWAS datasets, comparing the traditional [Formula: see text] test for association, the high-performance GBOOST method and the recently proposed GSS statistic (Available at http://bioinformatics.research.nicta.com.au/software/gwis/). We use Spearman's correlation to measure the similarity between the folds of cross validation. To compare incomplete lists of ranks we propose an extension to Spearman's correlation. The extension allows us to consider a natural threshold for feature selection where the correlation is zero. This is the first reported cross-validation study of exhaustive bivariate GWAS feature selection. We found that stability between ranked lists from different cross-validation folds was higher for GSS in the majority of diseases. A thorough analysis of the correlation between SNP-frequency and univariate [Formula: see text] score demonstrated that the [Formula: see text] test for association is highly confounded by main effects: SNPs with high univariate significance replicably dominate the ranked results. We show that removal of the univariately significant SNPs improves [Formula: see text] replicability but risks filtering pairs involving SNPs with univariate effects. We empirically confirm that the stability of GSS and GBOOST were not affected by removal of univariately significant SNPs. These results suggest that the GSS and GBOOST tests are successfully targeting bivariate association with phenotype and that GSS is able to reliably detect a larger set of SNP-pairs than GBOOST in the majority of the data we analysed. However, the [Formula: see text] test for association was confounded by main effects.
منابع مشابه
lodGWAS: a software package for genome-wide association analysis of biomarkers with a limit of detection
UNLABELLED Genome-wide association study (GWAS) of a biomarker is complicated when the assay procedure of the biomarker is restricted by a Limit of Detection (LOD). Those observations falling outside the LOD cannot be simply discarded, but should be included into the analysis by applying an appropriate statistical method. However, the problem of LOD in GWAS analysis of such biomarkers is usuall...
متن کاملTargeted detection of the cancer cells using the anti-CD24 bio modified PEGylated gold nanoparticles: the application of CD24 as a vital cancer biomarker
Objective(s): The central role of molecular imaging modalities in cancer management is an undeniable fact that could help to diagnose cancer tumors in early stages. The main aim of this study is to prepare a novel targeted molecular imaging nanoprobe of CD24-PEGylated Au NPs to improve the ability of Computed tomography scanning (CT scan) outputs for both in vitro and in vivo detection of breas...
متن کاملThe Nature of Genetic Variation for Complex Traits Revealed by GWAS and Regional Heritability Mapping Analyses.
We use computer simulations to investigate the amount of genetic variation for complex traits that can be revealed by single-SNP genome-wide association studies (GWAS) or regional heritability mapping (RHM) analyses based on full genome sequence data or SNP chips. We model a large population subject to mutation, recombination, selection, and drift, assuming a pleiotropic model of mutations samp...
متن کاملMetrics and Acquisition Modes for Fixation Stability as a Visual Function Biomarker.
Purpose To compare different metrics and acquisition modes of fixation stability as a new visual function biomarker in a large cohort of patients with ABCA4-related Stargardt disease from the multicenter prospective ProgStar study. Methods Fixation was tested during a separate fixation exam and also dynamically during a sensitivity exam, using fundus-tracking microperimetry (Nidek MP-1). Fixa...
متن کاملA rapid, early detection of oral squamous cell carcinoma: Real time PCR based detection of tetranectin
The current study is focused on determining the mRNA expression levels of tetranectin, to detect oral squamous cell carcinoma (OSCC) and thus aiding in its classification at an early stage. RNA was isolated and cDNA synthesis was performed from the saliva samples of the patients and healthy individuals. A semiquantitative PCR based analysis was performed prior to quantitative and expression bas...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره 9 شماره
صفحات -
تاریخ انتشار 2014